TAB A 

Docket No.: PF-0619 USN 
USSN: 09/807,452 



Proc. Sail Acad, Sri USA 

Vol. 93. pp. 10614-10619. October 1996 

Biochemistry 



Parallel human genome analysis: Microarray-based expression 
monitoring of 1000 genes 

(Human Genome Project /DNA chip /gene discovery/T cell) 

Mark Schena*^, Dari Shalon*. Renu Heifer*, Andrew Qui", Patrick O. BrownS. and Ronald W. Davis' 

•Dcpanmcni of BuKhcmistry. Bcckmin Center. Stanford University Medical Center. SuninnL CA WW: *SvTHcni. Pah. Alto CA and »D,- M nm,m «f 

Binchemisiry and H.mard Huphcs Medical Institute. Bcekman Ccnier. Stanford University Medical Center. StanMrd CA *lil5 ^P'nm.nt tit 

Contributed by Ronald W. Da\is. June 26. 1996 



ABSTRACT Microarrays containing 1046 human cDNAs 
of unknown sequence were printed on glass with high-speed 
robotics. These 1.0-cm 2 DNA "chips" were used to quantita- 
tively monitor differential expression of the cognate human 
genes using a highly sensitive two-color hybridization assay. 
Array elements that displayed differential expression patterns 
under given experimental conditions were characterized by 
sequencing. The identification of known and novel heat shock 
and phorbol ester-regulated genes in human T cells demon- 
strates the sensitivity of the assay. Parallel gene analysis with 
microarrays provides a rapid and efficient method for large- 
scale human gene discovery. 



Biology has entered the genome era (1). Complete genome 
sequences for all of the model organisms and human will 
probably be available by the year 2003 (2). Torrents of human 
expressed sequence tags (ESTs) provide a starting point for 
elucidating the function of tens of thousands of cognate genes 
(3). Genome analysis will provide insights into growth, devel- 
opment, differentiation, homeostasis, aging, and the onset of 
diseases (1-3). A detailed understanding of the human genome 
will require the implementation of sophisticated methods for 
gene expression analysis and gene discovery. 

Recently, a microarray-based method for high-throughput 
monitoring of plant gene expression was described (4)? This 
**chip"-based approach involved using microarrays of cDNA 
clones as gene-specific hybridization targets to quantitatively 
measure expression of the corresponding plant genes (4, 5). A 
two-color fluorescence labeling and detection scheme facili- 
tated sensitive differential expression analysis of different 
plant tissues (4. 5). The efficiency of this approach for studies 
in higher plants suggested the use of this method for human 
genome analysis (4-7). Here, we report the use of cDNA 
microarrays for human gene expression monitoring, bioloeical 
investigation, and gene discovery. 

MATERIALS AND METHODS 

Human cDNA Clones. The cDNA library was made with 
mRNA from human peripheral blood lymphocytes trans- 
formed with the Epstein-Barr virus. Inserts >600 bp were 
cloned into the lambda vector AYES-R to generate 10 7 -10" 
recombinants. Bacterial transformants were obtained by in- 
fecting £. colt strain JM107/AKC. Colonies were picked at 
random and propagated in a 96-well formal, and minilvsate 
DNA was prepared by alkaline lysis using REAL preps 
(Oiagen, Chatsworth. CA). Inserts were amplified bv PCR in 
a 96-weli format using primers (PANI32, 5 : -CCTC- 
TATACTTTA ACGTC A AGG ; and PAN133, 5'-TTGTGTG- 
GAATTGTGAGQGG) complementary to the AYES 
polylinker and containing a six-carbon amino modification 
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(Glen Research. Sterling. VA) on the 5' end. PCR products 
were purified in a 96-well format using QlAquick columns 
(Oiagen). 

Microarray Preparation. Amino-modified PCR products 
were suspended at a concentration of 0.5 mg/ml in 3x 
standard saline citrate (SSC) and arrayed from 96-well micro- 
liter plates onto silylated microscope "slides (CEL Associates, 
Houston) using high-speed robotics (4-7). A total of 1056 
cDNAs. representing 1046 human clones and 10 Arabidopsis 
controls, were an-ayed in 1.0-cm : areas. Printed arravs were 
incubated for 4 hr in a humid chamber to allow rehydration of 
the array elements and rinsed, once in 0.2% SDS* for 1 min, 
twice in H : 0 for 1 min. and once for 5 min in sodium 
borohydride solution (1.0 g of NaBH4 dissolved in 300 ml of 
PBS and 100 ml of 100% ethanol). The arravs were submerged 
in H : 0 for 2 min at 95°C, transferred quickly into 0.2% SDS 
for 1 min. rinsed twice in H 2 0, air dried, and stored in ihe dark 
at 25°C. 

Fluorescent Probes. Tissue mRNAs were purchased 
(CLONTECH). Jurkat mRNA was isolated as described by 
Schena ct at (4). Probes were made as described (4) with 
several modifications. The reverse transcriptase used here was 
Superscript II RNase H- (GIBCO). The Cy5-dCTP was 
purchased from Amersham. Each reverse transcription reac- 
tion contained 3.0 u\% of total human mRNA* Arabidopsis 
control mRNAs were made bv in vitro transcription of cloned 
HAT4, HAT22. and YesAt-23 cDNAs (4. 8. 9) using an RNA 
Transcription Kit (Stratagene). For quantitation, the mRNAs 
were doped into the reverse transcription reaction at ratios of 
1 : 100.000. 1 : 1 0,000. and 1 : 1000 (wt/wi ) respectively. Following 
the reverse transcription step, samples were treated with 2.5 id 
of 1 M sodium hydroxide for 10 min at 37°C, then neutralized 
by adding 2.5 *il of 1 M Tris HCl (pH 6.8) and 2.0 iJ of 1 M 
HCI. Probe mixtures contained cDNA products derived from 
3 ng of total mRNA. suspended in 5.0 of hybridization 
buffer (5x SSC plus 0.2% SDS). 

Hybridization and Scanning. Probes were hybridized to 
1.0-cm : microarrays under a 14 x 14 mm glass coverslip for 
6-12 hr at 60°C in a custom-built hybridization chamber (4-7). 
Arrays were washed for 5 min at room temperature (25°C) in 
low stringency wash buffer (1 x SSC/0.2% SDS), then for 10 
min at room temperature in high stringency wash buffer (0.1 x 
SSC/0.2% SDS). Arrays were scanned in 0.1 x SSC using a 
fluorescence laser scanning device (4-7). fitted with a custom 
filler set (Chroma Technology. Brattleboro, VT). Accurate 
differential expression measurements (i.e., final fluorescence 
ratios) were obtained by taking the average of the ratios of two 
independent hybridizations. 



Abbreviation: EST, expressed sequence tag. 

Data deposition: The sequences reported in this paper have been 
deposited in the GenBankdata base (accession nos. TJ5 6654 -U5 6660). 
Ho whom reprint requests should be addressed, e-mail: schenatfr 
cmgm.stanford.edu. 
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Cell Culture. Jurkat cells were grown in a tissue culture 
incubator (3TC and 5?c CO : ) in RPMI medium supplemented 
with 10$r fetal bovine serum. 100 u.g of streptomycin per ml. 
and 500 units of penicillin per ml. Heat shock corresponded to 
a -*-hr incubation at 43 C C. Phorbol ester treated cells were 
grown for 4 hr in the presence of 50 ng of phorbol 12-myristate 
13-acetate (PMA) per ml. 

RNA Blotting. Dot blots were performed as described (4). 

DNA Sequencing. Sequences were obtained using the 
PAN 132 and PAN 133 primers and a 373A automated se- 
quencer, according to the instructions of the manufacturer 
(Applied Biosystems). 

Computer Graphics and Informatics. Pseudoc olor represen* 
^atipnLSj^MLuPre$cenWi 
of Health image software (version 1.52). Software for differential 
expression representations was purchased from Imaging Re- 
search (St. Catherines. ON. Canada). Sequence searches were 
made to the n on redundant nucleotide data base at the National 
Center for Biotechnology Information (NCB1) using Macintosh 
blast software. The EST data base was accessed via the World 
Wide Web (http:/www.ncbi.nlm.n ih.gov/). 

RESULTS 

Gene Discovery and the Heat Shock Response. Micto arrays 
were used to examine the heat shock response in cultured 
human T (Jurkat) cells. Control (37°C) and heat-treated 
(43 e C) cells were harvested and lysed. and total mRNA from 
the two cell samples was labeled by reverse transcriptase 
incorporation of fluorescein- and Cy5-dCTP. respectively. In 
a second set of labeling reactions, the fluorescent groups were 
"swapped" such that samples from control and heat-treated 
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samples were labeled with Cy5- and f luorescein-dCTP. respec- 
tively. Each pair of fluorescent probes was hybridized to a 
1056-element microarray. The arrays were washed at high 
stringency and scanned with a confocal laser scanning device 
to detect emission of the two fluorescent croups. 

Hybridization signals were observed to >95*> of the human 
cDNA array elements, but not to any of the Arabidopsis 
negative controls (Fig. 1). Fluorescence intensities spanned 
more than three orders of magnitude for the 1046 array 
elements surveyed (Fig. 1). Comparative expression analysis of 
heat shocked versus control cells in the two experiments 
revealed 17 array elements that displayed altered fluorescence 
wiiof of ^ 

differentially expressed genes, 11 were induced by heat shock 
treatment and 6 displayed modest repression (Figs. 1 and 2.-1). 

To determine the identity* of the heat-regulated genes. 
cDNAs corresponding to each of the 17 array elements were 
sequenced on the proximal and distal end. Data base searches 
revealed perfect matches for 14 of the 17 clones, and in each 
case proximal and distal cDNA sequences mapped to the same 
gene (Table 1). Of the 1046 human genes examined on the 
microarray. the five most highly induced in heat-treated cells 
were heat shock protein 90a (hsp90a), dnaJ. hsp9O0. polyu- 
biquitin. and t-complex polypeptide- 1 (tcp-1) (Table 1). Three 
of the 17 clones did not match any entry in the public data base, 
though one of the clones (B7) exhibited significant homology 
to an EST from Caenorhabditis elegant (Table 1 ). Each of the 
novel sequences (B7-B9) exhibited •* 2-fold induction (Table 1) 
and relatively low-level expression (Table 2). 

To confirm the microarray results. mRNA levels for each of 
the genes were measured by RNA blotting. Each of the genes 
that displayed heat shock induction, including the three novel 
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Fig. 1. Human gene expression monitored on a microarray. Fluorescent scans represented in a pseudocolor scale correspond to expression levels. 
The array contains 10 Arabidopus controls (upper left corner, elements 1-10) and 1046 human peripheral blood cDNAS. Fluorescent DTobes were 
prepared by labclins: mRNA from Jurkat cells grown at 3TC (-Heat Shock, A) or 43 B C ( +Heat Shock, B). Array elements that display altered 
fluorescence intensity (white boxes) corresponded to genes activated (red boxes) or repressed (green boxes) by heat shock- The color bar was 
calibrated in separate experiments using known quantities (wi/wt) of Arabidopus control mRNAs added to the labeling reaction. Microarray rows 
(at left) and columns (at the top) are demarcated at 10 element increments (white circles). (Bar = 1 mm.) 
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Expression Ratios 

Fig. 2. Elemental displays of activated and repressed genes. Fluorescence ratios of two-color microarrav scans (Fig. 1) are depicted 
schematically. Fluorescein-labeled probes from Jurkat cells subjected to (A) heat shock or (B) phorbol ester treatment were compared with 
Cy5-labeled probes from untreated cells. In a second set of reactions, the fluorescent groups were swapped (sec text). The data represent the average 
of the ratios from two hybridizations, excluding values in which the difference of the two ratios was greater than half the average ratio. The color 
bar corresponds to expression ratios, which are independent of the absolute expression level of a given gene. 



Table 1. Microarrav elements corresponding to differentially expressed genes 



Clone 


Row 


Column 


Ratio 


Blast identity 


Accession no. 


Bl 


24 


21 


0.5 


CYC oxidase III 


J01415. J01415 


b: 


1 


31 


0.5 


0-Actin 


NR. X00351 * 


B3 


15 


8 


0.5 


CYC oxidase III 


J01415. J01415 


B4 


32 


19 


0.5 


CYC oxidase III 


J01415. J01415 


B5 


17 


8 


0.5 


CYC oxidase III 


J01415. J01415 


B6 




31 . 


0.5 


0-Actin 


-NR. X00351 


B7" 


5 


4 


2.0 


.Novel* 


U56653. U56654 


B8 


2 


19 


2.0 


Novel T 


U56655. U56656 


B9 


14 


5 


1 1 . 


Novel* 


U56657. U56658 


BIO 


7 


8 


2.4 


Polyubiquittn 


XO4803, X04K03 


Bll 


12 


i 


2.4 


TCP-1 


X52SS2. X52882 


bi: 


28 


i 


2.5 


Potyubiquitin 


M17597. M17597 


B13 


14 


7 


2.5 


Polyubiquittn 


X04K03. XO4803 


B14 


20 


9 


2.6 


HSP9O0 


Ml 6660. Ml 6660 


B15 


30 


12 


4.0 


DnaJ homolog 


D 13388. D 13388 


B16 


10 


5 


5.8 


HSP90o 


X07270. X07270 


B17 


13 


16 


6.3 


HSP<X»o 


M27024. X15I83 


B18 


7 


19 


2.0 


02-microglobulin 


S54761. M30683 


B19 


21 


30 


2.1 


Novel T 


U56659. U56660 


b:o 


3 


26 


1 1 


^^microglobulin 


S54761. M30683 


b:i 


1 


18 


2.6 


PGK 


Ml 1968. L00160 


b:: 


n 


30 


3.5 


NF-«B1 


247744, M55643 


B23 


20 


16 


19 


PAC-1 


LI 1329. LI 1329 



Clone name, array position (Fig. 1). fluorescence ratio, sequence identity, and acession number of cDNAs that manifested 
a differential expression pattern with probes prepared from heat shock- (Bl-17) or phorbol ester-treated (B1S-23) Jurkat cells. 
Clones showing >9&7c identity over 300 nucleotides were assumed to be identical to known sequences. All genes arc nuclear 
except CYCbxidase III (mitochondrial). Accession numbers reflect the highest score for proximal and distal sequence traces, 
respectively. CYC. cytochrome c: TCP-1. T-complex polypeptide: HSP. heat shock protein: PGK. phosphoelvceratc kinase: 
NF-*B. nuclear factor- kappaB: PAC-1. phosphatase of activated cells: and NR. trace not readable due to* the presence of 
poly(A)* tract. 

*B7 is 67*£ identical to an EST from C. elegant (D76026). 
r No match in the public data bases. 
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Tabic 2. Human gene expression monitored by m icroarray and RNA blot analyses 

Expression level per 10* mRXAs 



1061: 



Done 



Blast identity 



Bl CYC oxidase III 

B2 0- Act in 

B3 CYC oxidase ni 

B4 CYC oxidase III 

B5 CYC oxidase III 

B6 0-Actin 

B7 Novel (weakly to D76026) 

B8 Novel 

B9 Novel 

BIO. Rotvubiouiiin 



B12 
B13 
B14 
B15 
B16 
B17 
B18 
B19 
B20 
B21 
B22 
B23 



TCP- 1 ! 

Polyubiquitin 

Polyubiquitin 

HSP9O0 

DnaJ homolog 

HSP90o 

HSP90o 

^microglobulin 
Novel 

fe-microglobulin 
Phosphoglycerate kinase 
NF-KB1 
PAC-1 



Microarray 


R2110 


RNA blot 


92/46 


0.5 


100/80 


240/120 


OS 


270/280 


36/18 


OS 


ND 


76/38 


OS 


ND 


62/31 


OS 


ND 


180/89 


OS 




1.3/2.6 


2.0 


n 77/1 11 
u. / / / 1 .8 


10/4.0 


2.0 


1.5 /^4 


0.8/1.8 


2^ 


1 1/1 D 

1-W 1.6 










"2.4 


7.1/27 


0.8/2.0 


2.5 


ND 


1-7/4.3 


2S 


ND 


75/200 


2.6 


30/120 


1.0/4.0 


4.0 


1.6/13 


0.6/3.5 


5.8 


3.2/29 


0.8/5.0 


6.3 


8.6/62 


1-0/2.0 


2.0 


5.4/15 


12/15 


2.1 


AS/9S 


2.7/5.9 


2.2 


ND 


2.4/6.2 


2.6 


4.7/9.2 


1.7/6.0 


33 


0.65/4.7 


03/9.5 


19 


0.21/15 



Ratio 



0-8 
1.0 
ND 
ND 
ND 
ND 
2.? 
2.3 
IS 



3.8 

ND 
ND 

4-0 
8.1 
9.1 
7.2 
2.8 
2S 
ND 
2-0 
7.2 
71 



or iSS iS £ ^ 7 r 100 ; 00 ° (Wt/wl) of * cncs with a niicfoarrav (Fig. 1) 
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sequences, exhibited elevated mRNA levels by dot blot analysis 
(Table 2). In all cases, expression ratios as determined bv the 
two procedures differed by <2-fold for the genes identified in 
the heat shock experiments (Table 2). The two assavs differed 
more widely in terms of assessing absolute expression levels; 
nonetheless, absolute expression as monitored on a microarray 
typically correlated with RNA blots to within a factor of five 
(Table 2). 

Phorbol Ester Signaling. To explore a signaling pathway 
distinct from the heat shock response, microarrays were used 
to examine the cellular effects of phorbol ester treatment. 
Jurkat cells were treated with phorbol ester, harvested, lysed! 
and used as a source of mRNA. Samples of mRNA from 
untreated or phorbol ester-stimulated cells were labeled with 
reverse transcriptase. The probes were mixed, hybridized to 
microarrays. and scanned for fluorescence emission of the two 
fluorescent groups. A total of six array elements displayed 
a2.0-fold elevated signals with probes from phorbol ester- 
treated cells relative to control samples (Fig. IB). 

To determine the identity of the phorbol esier-induced 
genes, clones corresponding to the six array elements were 
sequenced. Data base searches revealed perfect matches for 
five of the six sequences (Table 1). The two most highly 
induced genes were the PAC-1 tyrosine phosphatase and 
nuclear factor-kappa Bl (NF-kBI); modest activation was 
observed for phosphoglycerate kinase and ^-microglobulin 
(Table 1). One remaining clone (B19) did not match any entry 
in the public data base (Table 1). B19 displaved a 2.1-fold 
induction and, similar to the novel heat shock genes, a rela- 
tively low absolute expression level (Tables 1 and 2). All six of 
the phorbol ester-inducible genes displaved increased steady- 
state mRNA levels tjy RNA blotting (Table 2). PAC-1 expres- 
sion (Fig. 1; Table 2) defined a detection limit of -1:500.000 
for the assay. 

Transcript Imaging in Human Tissues. To determine 
whether microarrays could be used to monitor expression in 
human tissues, probes were prepared from human bone mar- 



row, brain, prostate, and heart by labeling each mRNA sample 
with Cy5-dCTP. In a separate reaction, a control probe was 
prepared by labeling Jurkat mRNA with fluorescein-dCTP. 
The four CV5-labeled probes were each mixed with an aliquot 
of the f luorescein-labeled control sample, and the four mix- 
tures were hybridized to separate microarrays. The arrays were 
washed and scanned for fluorescence emission, and hybrid- 
ization signals for each of the tissues samples weft normalized 
to the Jurkat control to generate an expression profile for each 
of the 1046 clones present on the array. 

Detectable expression was observed for all 15 of the heat 
shock and phorbol ester-regulated genes in the four tissue 
types examined (Fig- 3). In general, the expression level of each 
gene in Jurkat cells correlated rather closelv with expression in 
the four tissues (Table 2; Fig. 3). Genes encoding 0-actin and 
cytochrome r oxidase, the two most hichlv expressed of the 15 
genes in Jurkat cells (Table 2). were highly expressed in bone 
marrow, brain, prostate, and heart (Fig. 3A). Expression of 
cytochrome c oxidase. hsp90a. and the novel B7 sequence was 
significantly greater in heart than in the other tissues (Fig. 3). 

DISCUSSION 

Many of the heat shock genes identified in this study encode 
factors that function either as molecular "chaperones" 
(HSP90q. HSP90/3. DnaJ, TCP-1) or as mediators of protein 
degradation (polyubiquitin). The identification of these se- 
quences is consistent with the biochemical basis of heat shock 
induction (10-15). Proteins undergo denaturation at elevated 
temperatures, and those that fail to maintain proper confor- 
mation must be selectively degraded (10-15). It will be inter- 
esting to determine whether the three novel heat shock- 
inducible sequences (B7-B9) mediate protein folding and 
turnover or possess some other biochemical activity. Complete 
nucleotide sequence determination, conceptual translation, 
expression monitoring, and biochemical analysis should pro- 
vide a detailed functional understanding of these genes. 
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Fig. 3. Transcript profiles of heat shock and phorbol ester- 
regulated genes. Gene expression levels per 100.000 mRNAs (j*axes) 
arc shown for 15 genes (Table 1) in human bone marrow (red), brain 
(green), prostate (blue), and heart (yellow). Genes are grouped 
according to expression levels (A-C). 



Phorbol ester, a potent activator of protein kinase C ( 16. 17). 
induced a set of genes distinct from those involved in the heat 
shock pathway. The most highly induced gene identified in this 
study. PAC-L encodes a nuclear tyrosine kinase that may play 
a role in regulating transcription and cell cycle progression 
(18). NF-kB1. a second phorbol ester-inducible gene, is an 
intensively studied member of the Rel transcription factor 
family (19-21). The Rel proteins are activated by a large 
number of stimuli, including phorbol esters, cytokines, bacte- 
rial and viral pathogens, and ultraviolet light (19-21). Modest 
activation was observed for three sequences not known to be 
inducible by phorbol esters, including phosphoglycerate ki- 
nase, ^-microglobulin, and a novel human gene (B19). Ex- 
tensive expression monitoring with microarrays should assist in 
understanding how each of these genes integrate into the 
highly complex phorbol ester signaling pathway. 

It is striking that four novel human genes were discovered 
with an array of 1000 randomly chosen clones, particularly 
because the heat shock and phorbol ester signaling pathways 
have been so intensively studied (10-21). The facile discovery 
of these sequences underscores the fact that microarrays can 
be used for gene discovery in the absence of any sequence 
information. By this approach, clones are chosen at random 
from any library of interest and only those clones that display 
interesting expression patterns are sequenced and character- 
ized. This parallel assay, coupled with a modest DNA sequenc- 
ing facility, allows high-throughput human genome expression 
analysis and gene discovery. 

Genes that are activated or repressed by a given stimulus 
provide functional clues to the cellular pathway involved 
(22-24). Detailed examination of these gene expression "sig- 
natures" can provide a dynamic view of the mode of action of 
a given signaling substance (22-24). Microarrays may thus 
allow rapid mechanistic examination of hormones, drugs, 
elicitors, and other small molecules; moreover, functional 
analysis of transcription factors, kinases, growth factors, cyto- 
kines, receptors, and other gene products should be possible. 
Efforts are underway to develop mRNA amplification strate- 
gies to enable probe preparation from minute tissue samples. 
This capability might allow for high-throughput patient screen- 
ing in a clinical setting. 

The current detection limit of the assay allows monitoring of 
transcripts that represent -1:500.000 (wt/wt) of the total 
mRNA. This 10-fold increase in sensitivity compared with the 
original report (4) was achieved largely by modifying the 
coupling chemistry, which reduced background fluorescence. 
The significance of this improvement is considerable in that 
approximately half the human genes identified in this study, 
including all four novel sequences, exhibited expression levels 
below the original detection limit of 1:50.000 (4). 

The ability to detect 2-fold changes in expression was 
achieved by the use of two-color fluorescence in the labeling 
and detection schemes, digitized data collection, and custom 
software. The importance of this capability is underscored by 
the fact that nearly all of the genes examined here exhibited 
< 6- fold changes in expression. The four novel genes, which 
showed £ 2.2-fold activation, were probably overlooked in 
previous screens that used conventional differential expression 
techniques. It may be possible to further improve the precision 
of the microarray assay by the use of closely related fluorescent 
analogs, such as Cy3 and Cy5. in the labeling and hybridization 
reactions. 

Microarrays offer a number of advantages over other po- 
tential high-capacity approaches to expression analysis. The 
chip-based approach enables small hybridization volumes, high 
array densities, and the use of fluorescence labeling and 
detection schemes. These features provide a set of perfor- 
mance specifications that are unattainable with filter-based 
approaches (25. 26). The use of cDNA clones provides hy- 
bridization specificity that is not readily attained with oligo- 
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nucleotide arrays (27-30). The parallel format of the assav 
provides a simultaneous differential expression readout for 
>1000 genes. This contrasts with sequencine-based methods, 
which require serial data collection for expression analvsis (31. 
32). A commercial source of cDNA microarrays would greatly 
speed the use of a chip-based approach to expression analysis. 

The availability of large numbers of ESTs (3) provides a rich 
resource of human cDNA clones for microarraying. The 
>400,000 ESTs in the public data bases represent a significant 
subset of all human genes (3. 33). Microarravs of thousands of 
ESTs will provide a powerful analytical tool" for future human 
gene expression studies. The -100.000 genes in the human 
genome (Z 33) emphasize the need for microarravs of greater 
de ^»y. Attempts to impro ve microdeppshion techniques are 
ulSderwayalS^ 

a complete set of human gene targets (http://cmgm^tanford 
edu/-schena/). Microarravs of -100.000 cDNA elements 
would allow expression monitoring of the entire human ge- 
nome in a single hybridization. This capacity, coupled with 
detailed biochemical analysis of the individual gene products 
would greatly speed the functional analysis of the human 
genome. 
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